Learning Video-Story Composition via Recurrent Neural Network

نویسندگان

  • Guangyu Zhong
  • Yi-Hsuan Tsai
  • Sifei Liu
  • Zhixun Su
  • Ming-Hsuan Yang
چکیده

In this paper, we propose a learning-based method to compose a video-story from a group of video clips that describe an activity or experience. We learn the coherence between video clips from real videos via the Recurrent Neural Network (RNN) that jointly incorporates the spatial-temporal semantics and motion dynamics to generate smooth and relevant compositions. We further rearrange the results generated by the RNN to make the overall video-story compatible with the storyline structure via a submodular ranking optimization process. Experimental results on the video-story dataset show that the proposed algorithm outperforms the state-of-the-art approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Input-driven unsupervised learning in recurrent neural networks

Porto, the institutional repository of the Politecnico di Torino, is provided by the University Library and the IT-Services. The aim is to enable open access to all the world. Please share with us how this access benefits you. Your story matters.

متن کامل

Prediction of the Effect of Polymer Membrane Composition in a Dry Air Humidification Process via Neural Network Modeling

Utilization of membrane humidifiers is one of the methods commonly used to humidify reactant gases in polymer electrolyte membrane fuel cells (PEMFC). In this study, polymeric porous membranes with different compositions were prepared to be used in a membrane humidifier module and were employed in a humidification test. Three different neural network models were developed to investigate several...

متن کامل

Optimal Structure of Pipelined Recurrent Neural Network in Modeling of MPEG Video

This paper investigates optimal structure of Piplined Recurrent Neural Network (PRNN) for adaptive traffic prediction of MPEG video signal via dynamic ATM networks. The traffic signal of each picture type (I, P, and B) of MPEG video is characterized by a nonlinear autoregressive moving average (NARMA) process. Since those modules of PRNN can be performed simultaneously in a pipelined parallelis...

متن کامل

Learning Visual Storylines with Skipping Recurrent Neural Networks

What does a typical visit to Paris look like? Do people first take photos of the Louvre and then the Eiffel Tower? Can we visually model a temporal event like “Paris Vacation” using current frameworks? In this paper, we explore how we can automatically learn the temporal aspects, or storylines of visual concepts from web data. Previous attempts focus on consecutive image-to-image transitions an...

متن کامل

A Critical Review of Recurrent Neural Networks for Sequence Learning

Countless learning tasks require dealing with sequential data. Image captioning, speech synthesis, and music generation all require that a model produce outputs that are sequences. In other domains, such as time series prediction, video analysis, and musical information retrieval, a model must learn from inputs that are sequences. Interactive tasks, such as translating natural language, engagin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1801.10281  شماره 

صفحات  -

تاریخ انتشار 2018